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Abstract 

We introduce a novel functional time series methodology for short-term load forecasting. The 
prediction is performed by means of a weighted average of past daily load segments, the shape 
of which is similar to the expected shape of the load segment to be predicted. The past load 
segments are identified from the available history of the observed load segments by means of their 
closeness to a so-called reference load segment, the later being selected in a manner that captures 
the expected qualitative and quantitative characteristics of the load segment to be predicted. Weak 
consistency of the suggested functional similar shape predictor is established. As an illustration, 
we apply the suggested functional time series forecasting methodology to historical daily load data 
in Cyprus and compare its performance to that of a recently proposed alternative functional time 
series methodology for short-term load forecasting. 
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1 Introduction 

Load forecasting is an integrable process in the design of power systems faced by electricity authorities 
worldwide, involving accurate predictions of electric load over different time periods in the furure. It 
can be broadly classified as short-term, medium-term and long-term forecasting, in terms of planning 
time horizons. Although different planning time horizons for these categories seem to exist in the 
literature, we have adopted herein the load forecasting classification scheme of Srinivasan and Lee 
(1995), viz., up to 1 day for short-term load forecasting (STLF), more than 1 day up to 1 year for 
medium-term load forecasting (MTLF), and more than 1 year up to 10 years for long-term load 
forecasting (LTLF). See also Alfares & Nazeeruddin (2002). 

Competition, the need of saving raw materials that are used in the production of electrical energy, 
the reduction of emissions and the avoidance of money wasting in general, are the main reasons 



that force electricity authorities worldwide to proceed to a better planning for the production of 
electricity. The main characteristics of the programming are the quantity of electrical energy that 
needs to be produced and the type of machine that is going to be used. This can be achieved by 
requiring accurate STLF, an important category of load forecasting, that plays a major role in real- 
time control and security functions for designing larger power systems, see, e.g., Srinivasan (1998) 
and Laurent et al. (2007). Additionally, the particular characteristics of the electricity production 
process are essential for optimal planning of daily power generation. In connection with the fact 
that the produced electricity that is not consumed instantly is lost (electricity cannot be stored), 
it becomes obvious that the right planning of daily power generation must have as a result the 
avoidance of emergency situations as well as of producing much greater quantities of electricity than 
the ones needed. It must also offer the capability to electricity authorities worldwide to use as far as 
possible the low functionality cost machines for covering the electrical energy needed. As a result, it 
becomes easily understood that an accurate STLF, like the prediction of the consumption of electrical 
energy of the next day, is an important tool for good power planning. Over the years, a large 
number of methodologies have been developed to perform STLF. These methodologies are mainly 
emerged from two different paradigms, namely classical statistical techniques and computational 
intelligent techniques. In particular, classical statistical techniques include, among others, regression 
models, ARIMA time series models, Kalman filtering models, etc., see, e.g., Hyde & Hodnett 
(1997), Sargunaraj, Gupta & Devi (1997) and Huang Sz Shih (2003), while computational intelligent 
techniques include, among others, artificial neural networks, expert systems, etc., see, e.g., Rahman 
&d Hazim (1993) and AlFuhaid, El-Sayed & Mahmoud (1997). For extensive reviews on classical 
statistical techniques and computational intelligent techniques for STLF, we refer to, e.g., Hippert, 
Pedreira & Souza (2001), Alfares & Nazeeruddin (2002), Kyriakides & Polycarpou (2007) and Taylor 
& McSharry (2008). 

Here, we focus on statistical techniques for STLF. It should be stressed that STLF is commonly 
considered as a difficult task because the daily load demand is influenced by many factors, viz., weather 
conditions, holidays, weekdays, weekends, economic conditions, and, last but not least, idiosyncratic 
and social habits of individuals. Moreover, daily load demand is commonly recorded at a finite 
number of equidistance time points, viz., every half of each hour or every quarter of each hour. Thus, 
in order to forecast the load demand of next day, one has to predict the load demand at forty-eight or 
ninety-six, respectively, time points. From a statistical point of view, it is convenient to think of the 
daily load demand recorded at these forty-eight or ninety-six time points as a segment and to perform 
load prediction for the whole segment of time points rather than forecasting the load demand at each 
one of these time points separately. This implies that we adopt the functional time series framework 
in our approach, see, e.g., Bosq (2000, Chapter 9) and Ferraty & Vieu (2006, Chapter 12). 

Based on the previous discussion, we introduce below a novel functional time series methodology 
for STLF. The prediction is performed by means of a weighted average of past daily load segments, 
the shape of which is similar to the expected shape of the load segment to be predicted. The past load 
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segments are identified from the available history of the observed load segments by means of their 
closeness to a so-called reference load segment, the later being selected in a manner that captures the 
expected qualitative and quantitative characteristics of the load segment to be predicted. 

The paper is organized as follows. In Section [2j we provide some methodological background 
and review alternative functional time series methodologies that can be applied for STLF. We then 
describe the suggested functional time series forecasting methodology and establish weak consistency 
of the resulting functional predictor. As an application, we illustrate in Section [3| how the proposed 
functional predictor performs by applying it to the historical daily electricity load data in Cyprus and 
compare its performance to that of a recently proposed alternative functional time series methodology 
for STLF. Some concluding remarks are provided in Section [4} 

2 Functional Time Series Forecasting 

Putting the above discussion in a statistical methodological context, one seeks information on the 
evolution of a (real- valued) continuous-time stochastic process X = (X(t); t £ M) in the future. 
Given a trajectory (curve) of X observed on the interval [0, T], one would like to predict the behavior 
of X on the entire interval [T,T + 5], where 5 > 0, rather than at specific time-points. An appropriate 
approach to this problem is to divide the interval [0, T] into subintervals [16, (1 + 1)5], I = 0, 1, . . . , k — 1 
with k = T/8, and to consider the (function- valued) discrete-time stochastic process S = (S n ; n £ N), 
where N = {1, 2, . . .}, defined by 

S n (t) = X(t+ (n- 1)6), n G N, Vte[0,8). (1) 

For the specific STLF application in mind, where the aim is one-day ahead prediction, the 
segmentation parameter 5 corresponds to the daily electricity demand. In practice, the electricity 
demand is recorded at a finite number of equidistance time points within each day, say t±, ■ ■ ■ , tp, 
for instance, every half of each hour (viz., P = 48) or every quarter of each hour (viz., P = 96). 
Letting S n (U) to be the observation at time point tj, i = 1,2, ... ,P, within curve S n , n G N, we 
denote by 

S n = [S n (h), S n (t 2 ), . . . , S n {tp)}, IlGN, 

the segment of the total number of observations of the n-th curve S n , n G N. 

Therefore, given a 'sample' Si, S%, . . . , Sl of segments, our aim is then to predict the whole next 
segment <Sl+i, viz., to predict 

£l+i = [SL+i(h), SL+2{h), ■ ■ ■ , Sl+i(£p)]- 
2.1 Existing Approaches 

In the recent statistical literature, practically all investigations to date for the aforementioned 
functional time series prediction problem are for the case where one assumes that the underlying 
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stochastic process is driven by a Hilbert-valued, first-order, autoregressive processes, which implies 
that the best predictor, Sl+i, of curve Sl+i given its past history (the 'sample' of curves) 
S±,S2, ■ ■ ■ ,Sl, is the conditional mean of Sl+i given Sl, see, e.g., Bosq (1991), Bosq (2000, Chapter 
3). In practice, however, an appropriate version of Sl+i is obtained using some regularization on the 
predictor Sl+i of segment Sl+i- In particular, projection, spline and wavelet based regularization 
techniques have been proposed and consistency results of the resulting predictions have been obtained, 
see, e.g., Bosq (2000, Chapter 4), Besse & Cardot (1996), Pumo (1998), Besse, Cardot &; Stephenson 
(2000) and Antoniadis & Sapatinas (2003). For some recent variants of this autoregressive structure, 
see, e.g., Marion & Pumo (2004) and Mas & Pumo (2007). 

An alternative approach to this prediction problem, was proposed by Antoniadis, Paparoditis 
& Sapatinas (2006). They developed a predictor via functional kernel nonpar ametric regression 
estimation techniques, using a conditioning idea. In particular, prediction of segment Sl+i was 
obtained by kernel smoothing, conditioning on the last observed segment Sl, while the resulting 
predictor was expressed as a weighted average of the past segments, placing more weight on those 
segments the preceding of which is similar to the present one. This functional time series forecasting 
methodology is rooted in the ability to find 'similar' segments. Considering that segments can be 
sampled values of quite irregular curves, similarity matching was based on a distance metric on the 
discrete wavelet coefficients of a suitable wavelet decomposition of the available segments. For a 
similar approach, see, e.g., Ferraty, Goia & Vieu (2002) and Ferraty & Vieu (2006, Chapter 11). 

The common implicit assumption in developing the above functional time series forecasting 
methodologies is that all the available information for predicting segment Sl+i is essentially contained 
in the last observed segment, viz., segment Sl- However, it is more appropriate to assume that the 
profile of the daily electricity load demand depends, in a complicated and unknown way, on a set 
of quantitative variables, such as, daily temperature, daily humidity, daily wind speed, etc., as well 
as on a set of qualitative variables, such as, weekdays, weekends, holidays, seasonal characteristics, 
etc., of the day to be predicted. Apparently, this information is not necessarily contained in the 
behavior of the last (observed) segment Sl- Thus, functional time series forecasting approaches that 
are based on conditioning ideas on the past behavior of the observed segments can ignore important 
information concerning the segment to be predicted, which is not necessarily contained in the past 
observed segments. The numerical results we present in Section [3] clearly show that this information 
is proved valuable for accurate daily load demand forecasting. 

Recently, Ferraty et al. (2011) considered a kernel regression estimator when both the response and 
the explanatory variables are functional. Thinking of the explanatory variables being quantitative, 
such as, daily temperature, daily humidity, daily wind speed, etc., the resulting functional kernel 
regression approach could be used for STLF. However, this functional regression-based approach 
does not take appropriately into account a number of specific factors, the behavior of which turns 
out to be important for STLF. For instance, the same behavior of the daily temperature as an 
explanatory variable could lead to a different response, viz., daily load demand, depending on the 
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seasonal characteristics and on other factors, such as, weekdays, weekends, holidays, etc. This suggests 
that in order to perform accurate STLF, one needs to appropriately take into account not only the 
behavior of some quantitative variables but also on qualitative characteristics of the segment to be 
predicted, such as, weekdays, weekends, holidays, and seasonal factors, that jointly affect the daily 
load demand behavior. 

The above results in typical curves of daily load demand behavior depending on a number of 
quantitative and qualitative variables. Thus, in order to perform STLF one has to: (i) identify the 
appropriate curve of next day's load demand based on next day's behavior of these variables, and 
(ii) find in the entire time series history those curves that are similar to the identified one. With 
this in mind, we introduce below an alternative functional time series forecasting methodology for 
daily electricity load demand which performs the prediction by means of a weighted average of past 
segments, the shape of which is similar to the expected shape of the segment to be predicted. The past 
segments are identified from the entire history of the observed segments by means of their closeness 
to a so-called reference segment, the later being selected in a manner that captures the expected 
qualitative and quantitative characteristics of the segment to be predicted. Since the proposed STLF 
methodology is looking at the entire past for 'shapes' that are similar to the expected 'shape' of the 
day to be predicted, the resulting predictor is called the similar shape functional time series predictor 
(SSP). 

2.2 The Similar Shape Functional Time Series Predictor 

Based on the previous discussion, it is not unrealistic to assume that daily load demand depends on 
several quantitative factors, such as, daily temperature, daily humidity, daily wind speed, etc., and 
qualitative variables, such as, weekdays, weekends, holidays, season characteristics, etc. Suppose that 
these quantitative and qualitative characteristics result in a typical daily load demand shape, so that 
each curve S n can be expressed as 

M 

Sn{t) = fm(Tn(t),n n (t),W n (t),...)I(g n £ Gm) + £n{t), n G N, i G [0, 6), (2) 
m=l 

where 

• M is an unknown number of typical daily load demand shapes described by the unknown 
functions f m that correspond to different values of Q m , m = 1, 2, . . . , M. 

• Q rn = (Gi j7n , Q2,mi ■ ■ ■) denotes a group of deterministic variables, denoting qualitative 
characteristics, e.g., Q\, m refers to grouping of days (weekdays, weekends, holidays), Q2, m to 
the season characteristics, etc. 

(Here, and in what follows, 1(A) denotes the indicator function of the set A.) 

Remark 2.1. Notice that, in order to identify the seasonal characteristics, we do not only 
take into account the 'global seasonality', as this is described by autumn, winter, spring and 
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summer, but also the 'local seasonality', which refers to the weather conditions during the very 
recent past and which affect the behavior of the daily load demand. This is important since, 
for instance, a period of warm days in the winter causes a different behavior of the daily load 
demand compared to the one caused by a period of similar warm days in the spring or in 
the summer. This local seasonality aspects are also one of the reasons why regression based 
approaches are not very appropriate in this context. For instance, the same value of the daily 
temperature (the explanatory variable) may cause a different daily load demand (the response 
variable), depending on these local seasonal characteristics. 

• 9n = (di,n,92,n, ■ ■ •) denotes the qualitative characteristics of curve S n , n G N, e.g., gi >n refers 
to the group membership (weekdays, weekends, holidays), <?2,n refers to the season membership 
(autumn, winter, spring, summer), etc. 

• T n (t), T-L n (t), W n (t), . . . are exogenous random variables, denoting quantitative characteristics, 
e.g., T n (t) is the daily temperature curve, % n (t) is the daily humidity curve, W n {t) is the daily 
wind speed curve, etc. 

Notice that these curves are function- valued random variables, usually called functional random 
variables in the statistical literature. In other words, they are random variables defined on a 
common probability space with values in an infinite-dimensional metric space. 

• (e n (t), n G N) is a C([0, <5))-valued strong Gaussian white noise, viz., a sequence of independent 
and identically distributed (i.i.d) C([0, £))-valued Gaussian random variables, where C([0, 8)) is 
the space of continuous functions defined on the interval [0,(5). Furthermore, it is assumed that 
E(e n ) = and E(||e n || 2 ) < oo. (Note that, in this case, the errors e n (ij), i = 1, 2, . . . , P, forms 
a sequence of i.i.d. Gaussian random variables with zero mean and finite variance.) It is also 
assumed that e n {t) is independent of T s (t), Us(t), W s {t), . . . for all s < n, n G N. 

In what follows, for simplicity and data availability issues, we restrict our attention to the daily 
temperature curve 7~ n , n G N, which is one of the main exogenous functional random variables affecting 
daily load demand, that is, we consider the following model 

M 

Sn{t) = fm{T n (t)) I(g n G Q m ) + e n (t), n G N, t G [0, 5). (3) 

m=l 

(However, we note that the suggested methodology can be straightforwardly modified to any available 
number of exogenous functional random variables.) Let Tx, + i(tj) to be the observation at time point 
U, i = 1, 2, . . . , P, within curve Tl+i, viz., we denote by 

Tl+i = P~L+i(ii), TL + i(t 2 ), . . . , Ti + i(tp)] 

the segment of the total number of observations of the (L + l)-th curve Tl+i- 
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Given model ^ and based on the 'sample' S±, S2, ■ ■ ■ , Sl of segments, the suggested prediction 
procedure consists of the following two main steps: (a) identify among the M typical daily load 
demand shapes the one which is appropriate for the prediction of segment Sl+i, depending on 
the quantitative and qualitative characteristics of the segment to be predicted, and (b) obtain the 
prediction as a weighted average of all past segments by giving more weight to the segments the shape 
of which is more similar to the identified shape (obtained in (a)) of the segment to be predicted. 

The following algorithm describes in more detail how to construct the suggested SSP, Sl+i, of 
segment Sl+i. In particular, Steps 1 and 2 sort out the problem of identifying the appropriate shape 
of the segment to be predicted described in (a) while Step 3 refers to the calculation of the predictor 
described in (b). 

Step 1. Specify the group membership Qi jm and the local season membership Q2, m , where segment Sl+i 
belongs to. While specifying the group membership is easily done based on the particular day to 
be predicted (weekdays, weekends, holidays), the specification of the 'local seasonality' is more 
difficult and rather arbitrary. This is so, since, as explained before, local seasonality depends 
on the specific seasonal and weather characteristics of the very recent past of the time series 
and their stability. In our approach, local seasonality is essential and is taken into account by 
selecting a small number ul of past segments that are further considered for selecting what it 
is called the typical shape or reference segment. This is done in Step 2 of the algorithm. 

Step 2. Determine a relevant load profile by specifying a so-called reference segment, S^ Re \ as follows: 

• Find among the last ul segments those belonging to same group memberships Gi, m , 
m £ {1,2, ... , M} as segment Sl+i- Let Cl+i be the set of the selected segments, viz., 

Cl+i = {Si : gij = Gi, m , l<l< n L }. (4) 

Notice that the length of local seasonality and its stability is controlled by the parameter 
ul which determines how far in the past one goes to select a possible set of segments for 
specifying the reference segment S^^ . 

• Let Tl+i be a predictor of segment Tl+i, which should not necessarily be available on the 
entire set of time points {t%, t%, . . . , tp} (see below). 

• Let T> be any of the (equivalent) distances in R p . Then, the reference segment S^^ is 
obtained as 

s{Re){ti) = uk ^ Sliu) > i=1 > 2 '---> p > ( 5 ) 

where 

C* = {S t e Cl+i : I{V{T h T L +i) < 5)} 

and 

5 := 5(L) > mm {leCL+l} {V(T u f L +i)}. (6) 
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Notice that the reference segment S^ Re ^ is obtained as a simple average of those segments 
Si £ Cl+i belonging to the same group and having the same local seasonal characteristics, 
the temperature segments 1), I = 1,2, ... ,n,L, of which are close enough to the predicted 
temperature Tl+x of the segment Sl+i to be predicted. This 'closeness' is controlled by the 
parameter 5. We also point out (and this is important for practical applications) that it is 
not necessary to have the prediction Tl +1 on the entire set of time points {ii, t2, ■ ■ ■ , tp} to 
specify the set C* . In fact, one can compare the temperature segments T\ and Tl+\ using 
only the subset of time points on which the predictions for the segment Tl+i are available 
(or provided by other sources). 

Step 3. Finally, the SSP, Sl+i, of the segment Sl+i, is given by 

L 

S L+1 (U)=J2 W rS r (U), i = l,2,...,P, (7) 
r=l 

where the weights w r = w(S r , S^ Re ^), r = 1,2, ... ,L, satisfy w r > 0, r = 1,2, ... ,L, and 
^2r=i Wr = Following the nonparametric literature, the weights w r , r = 1,2, ... ,L, are 
chosen as 



^= ^ T;::: JL^ r=i,2,..., L , (8 ) 



K h (V(S r ,S^)) 

with Kh(-) = h~ 1 K(-/h) for some kernel function K, bandwidth and distance measure T> 
between segments. It is assumed below that K is a compactly supported bounded symmetric 
density such that f u 2 K(u)du < oo, and that h = h(L) — > as L — > oo. 

Remark 2.2. Notice that the SSP Sl+i is obtained as a weighted average of past segments, where 
more weight w r , r = 1, 2, . . . , L, is placed on the segment the shape of which is similar (in terms of the 
particular distance T> used) to the shape of the reference segment S^^ . This clarifies the differences 
between the suggested approach and several other approaches proposed in the literature that are 
based on conditioning ideas. In the case of conditioning on the last observed segment Sl (see, e.g., 
Antoniadis, Paparoditis & Sapatinas (2006)), the predictor is obtained as a weighted average of past 
segments, where the weight given to a segment depends on its closeness to the conditioning segment 
Sl- For the suggested approach, the role of the conditioning segment is taken over by the reference 
segment S^^ which comprehensively contains all relevant information regarding the shape of the 
daily load demand to be predicted. Thus, the selection of the reference segment S^ Re ^ is essential 



for the quality of the predictor obtained. This is stated more precisely in Lemma 5.1 which, in fact, 
shows that weak consistency of the suggested SSP is equivalent to weak consistency of the selected 
reference segment S^ Re \ 

Regarding the behavior of the smoothing parameters involved in the suggested prediction 
procedure, we assume the following. 

Assumption 2.1 
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(i) riL — > oo as L — > oo such that \C*\ — > oo, where C* is defined in Step 2. 

(ii) 5^-0 such that \C*\6 ->■ oo. 

(iii) /i — )• such that /iL — )• oo. 

Assumption 2.1 (iii) is standard for weak consistency in non-parametric kernel estimation. Assumption 
2.1 (i) requires that the number of segments taken into account to calculate the reference segment 

S (Re) 

viz., the number of segments belonging to the set C* , grows as the sample size increases. 
Assumption 2.1(h) requires that the bandwidth 5, used for obtaining the reference segment S^ Re \ goes 
to zero in such a way that the number of segments effectively used in calculating S^ Re \ viz., |C*|<5, 
increases to infinity, which is also as standard assumption for weak consistency in non-parametric 
kernel estimation. 

The following theorem establish the weak consistency of the suggested SSP. 

Theorem 2.1. Assume model Q) and that Assumption 2.1 is satisfied. Let Sl+i be defined by |?|) 
and assume that 

Sl+1 = fm(TL+l) + SL+1, (9) 

for some m £ {1,2,..., M}, where 

f m (T L+ i) = [f m (T L+1 (t 1 ))J m (T L+l (t 2 )),...J rn (T L+l (t P ))} 

and 
Then 

V(S L+1 , f m (T L+1 ) 4 0, as L -> oo, (10) 

V 

where — > denotes convergence in probability. 

3 Application: EAC Daily Load Data 
3.1 Description of the Data Set 

Electricity Authority of Cyprus (EAC) is the organization that is responsible for the generation, 
transmission and distribution of electricity in Cyprus. The target of EAC is to provide Cypriots with 
high quality of safe and reliable services and activities at competitive prices. EAC uses two types 
of machines to produce electricity. The first type of machine is a steam turbine that uses dynamic 
pressure generated by expanding steam to turn the blades of a turbine. Almost all large non-hydro 
plants use this system. About 80% of all electric power produced in the world is by use of steam 
turbine. The advantages of using such a type of machine are the high overall cogeneration efficiencies 
of up to 80%, the wide range of possible fuels including waste fuel and biomass, the production of 
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high temperature/pressure steam and the established technology. On the contrary, we can mark the 
low electrical efficiencies, the slow start up times, the poor part load performance and especially 
the need for expensive high-pressure boilers and other equipment. The second type of machine is a 
diesel engine which uses an electrical generator. Diesel generating sets are used in places without 
connection to the power grid, as emergency power-supply if the grid fails. Of course, they are widely 
used not only for emergency power but also many of them have a secondary function of feeding power 
to utility grids either during peak periods or during periods with a shortage of large power generators. 
Although, we may say that the cost of their functionality is forbidding. 

The best planning for EAC is to avoid these emergency situations, use machines which have low 
functionality cost and high electrical efficiencies for electricity generation and distribution, which 
would cover the needs of the whole island. The most important characteristic of the right plan should 
be the fact that the quantity of electricity produced must not be greater than one needs since the 
additional electricity produced can not be stored and is lost. An extremely useful tool for the right 
plan is the accurate prediction of the consumption of the electrical energy of the next day. 



Below, we apply the suggested SSP methodology proposed in Subsection |2.2| to a set of daily load 
data that were provided by EAC, concerning the electrical energy consumption, in megawatts (MW), 
per fifteen minute intervals, viz., P = 96, for the period from 01/01/2007 to 31/12/2010. The data 



set is displayed in Figure 5.1 From this figure, a slightly upward trend can be observed along with a 
strong periodic component within each year. It is also evident that the electrical energy consumption 
slightly increases every year and during the summer months attains its maximum. Since the goal 
is to predict the daily shape of electrical energy consumption, and not the overall trend, the EAC 
rescales daily curves by dividing them by their maximum value. This leads to daily shape curves 
that vary between zero and one. The aim is then to produce accurate predictions of the shape of the 
rescaled consumption of next day's electrical energy. We then transform the predictor to the original 
scale, by multiplying the resulting SSP by the maximum value of the electrical load of the day to be 
predicted, provided by EAC. In the next section, we demonstrate how the SSP methodology proposed 



in Subsection 2.2, can be implemented to fulfill this aim. 



3.2 Implementation of the SSP 

In our context, the curves S n , n = 1,2, ... ,L, that are derived from equation Q, coincide with 
the calendar days from the 1st of January 2007 up to the last day Sl, from which observations are 
available. Hence, Sl+i coincide with the day for which prediction is required. Based on the 'sample' 
Si, $2) • • • j Sl of segments, the goal is to specify the SSP Sl+i- To this end, the following steps are 
taken: 

1. The group memberships Gi,mi m & {1, 2, ... , M}, are specified, where the segment Sl+i belongs 
to. Feedback from the EAC, have shown that an appropriate grouping of days, with similar 
shape behaviour based on some national characteristics, is the following: 
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Group I : Monday, Tuesday, Thursday, Friday 
Group II : Wednesday 
Group III : Saturday 
Group IV : Sunday 

2. To determine the reference segment S^ Re \ as mentioned previously, we restrict our attention only 
to the exogenous random variable T n (t), n € N, that denotes the daily temperature segment. 
We then find all days belonging to same group memberships Gi >m , m £ {1, 2, . . . , M}, as the 



segment we want to predict. The parameter til, described in Subsection 2^ is set equal to 
til = 14 if the segment to be predicted corresponds to Monday, Tuesday, Thursday or Friday 
and is set equal to ul = 28 if the segment to be predicted corresponds to Wednesday, Saturday 
or Sunday. This seems to be appropriate in order to have enough information to select the 
reference segment while at the same time retains local seasonality. Regarding the exogenous 
random variable T n (t),n E N, we use a predictor Tl+i of Tl+i- As mentioned earlier, it is not 
necessary to have the prediction Tl+i on the entire set of time points {t\, t2, ■ ■ ■ , tp}, P = 96, 
to specify the set C* . In fact, one can compare the temperature segments I), I = 1,2, .. . ,ul, 
and Tl+i using only a subset of time points on which the predictions for the segment Tl+i are 
available (or provided by other sources). More specifically, we have used predictions of next 
day's temperature at only four time points, that is, those corresponding to 08:00, 12:00, 16:00 
and 20:00. The actual temperatures predictions of segment Tl+i at these time points were 
provided to us by EAC. 



3. To further simplify the selection of the reference segment S^ Re \ we set in Q 

5 = min {leCL+l} {V(Ti,f L +i)}, 
that is, the reference segment S^ Re ^ is obtained as 

S (Re) = argmin{5;gCL+i}{ p ( r,,f L+1 )}, (11) 

where Cl+i is given by Q. 



4. The SSP Sl+i is obtained by ([7j), where the weights w r , r = 1, 2, . . . , L, are determined by 
with the kernel function K being the Gaussian kernel and the bandwidth h being selected by 
the empirical risk of prediction methodology suggested by Antoniadis, Paparoditis & Sapatinas 
(2009). 

Finally, the computational algorithm related to the above implementation as well as the overall 



numerical study presented in Subection 3.3 has been carried out in the Matlab 7.7.0 programming 
environment. 
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3.3 Numerical Results 



Based on the above implementation, we apply the SSP functional time series forecasting methodology 
to a randomly selected number of days within the year 2010, from the dataset displayed in 



Figure 5.1, and compare its performance to the recently established wavelet-kernel functional time 
series methodology (WKP) proposed by Antoniadis, Paparoditis & Sapatinas (2006). We restrict 
our comparison to that forecasting methodology, since as it has been demonstrated by these authors 
in a number of simulated and real-data examples, in terms of functional time series forecasting, 
WKP outperforms many well-established forecasting methods, like a wavelet regularization method, 
a smoothing spline method, the classical SARIMA model and the Holt- Winters forecasting procedure. 
We refer to Antoniadis, Paparoditis & Sapatinas (2006) for more details. 

The quality of both SSP and WKP were measured by the relative mean-absolute error (RMAE) 
defined as 

RMAE = if \ § l + i(U) - Sl +1 (U)\ 

96^- S L+1 (U) 
i=i 

For both SSP and WKP, we also report the Maximum Difference (MaxDiff ) and Minimum Difference 
(MinDiff ) defined as 

MaxDiff = max {S L+1 (ti) - S L+1 (ti)} and MinDiff = min \S L+1 (ti) - S L +i(U)}. 

i=l,2,. ..,96 i=l,2,...,96 



It is evident from the analysis (see Figures 5.2 - 5.6 and Table 5.1 ) that the SSP clearly outperforms 



the WKP. It terms of RMAE, only in 5 out of the 30 randomly selected number of days the SSP 
performs slightly worse than the WKP. Furthermore, in a large number of days, the RMAE of the 



WKP considerably exceeds the RMAE of the SSP, as it is clearly seen in Figure 5.2 Looking at each 



day separately (see Figures 5.3-5.6), the SSP curves are quite close to the actual load curve while at 
the same time there are days where the WKP fails to appropriately capture even the overall behavior 
of the latter curve (see, e.g., 26 Jan 2010, 1 Aug 2010, 2 Oct 2010 and 12 Dec 2010). 



4 Conclusions 

We introduced a novel functional time series methodology for short-term load forecasting, named the 
functional similar shape time series predictor. The predictor was performed by means of a weighted 
average of past daily load segments, the shape of which is similar to the expected shape of the load 
segment to be predicted. To quantify this similarity, the notion of reference segment was introduced 
which captures the expected qualitative and quantitative characteristics of the load segment to be 
predicted. The functional similar shape predictor was theoretically justified by proving a weak 
consistency property. Furthermore, its usefulness for short-term load forecasting was demonstrated 
by applying it to historical daily load data in Cyprus. The numerical results obtained showed that the 
functional similar shape predictor works very satisfactory and outperforrms the functional wavelet- 
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kernel time series predictor, a recently established alternative functional time series methodology for 
short-term load forecasting. 

We note that, although for simplicity and data availability issues, we restricted our attention to 
the daily temperature, which is one of the main exogenous functional random variables, the suggested 
functional time series methodology for short-term load forecasting can be also modified to take into 
account other daily exogenous functional random variables, like humidity, wind speed, sunshine, etc., 
which might affect daily load demand. Although the above modification is straightforward from a 
theoretical point of view, its practical implementation depends on the availability of the required time 
series data. 



5 Appendix: Proof of Theorem 2.1 



To prove Theorem 2.1, we first prove the following auxiliary result, showing that weak consistency of 
the suggested SSP Sl+x, defined by (Fh, is equivalent to the weak consistency of the selected reference 
segment S( Re \ defined by dot 



Lemma 5.1. Assume model that Assumption 2.1 {in) is satisfied, and that holds true. Let 
g(Re) an d § L+1 be defined by and |?|) ; respectively. Then, (10) holds true, if and only if 



V{S^ Re \ f m {T L+l )) 4 0, as L -> oo. 
Proof. We first prove the "if" part of the lemma. Assume that ( |12[ ) holds true. Since 
V(S L+1 J m (T L+ i)) < T>(S L+1 , S^) + V(S( Re \ f m (T L+ x)) 



(12) 



:=A L + B L . 



Since 



A L < J2w r V(S r ,S {Re) ) = O p (h) -»• and B L ^0, as L -> oo 



r=X 



the "if" part of the lemma follows. 



We now prove the "only if" part of the lemma. Assume that (10) holds true. Note that, for each 
t € {tx, Hi ■ ■ ■ ) tp}, 

L 



\S L+1 (t) - S( Re \t)\ <Y,^r \S r (t) ~ S( Re Ht)\ = O p (h). 



r=l 



Also, 



V{S^ Re \ f m (T L+1 )) < V(S( Re \S L+1 ) + V(S L+1 , f m {T L+l )) 
■ = C L + D L . 



Since 



V 



Cl = O p (h) and Dl — > 0, as L — > oo, 
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the "only if" part of the lemma follows. 



□ 



We are now ready to prove Theorem 2.1 In view of Lemma 5.1 it suffices to prove From (J, 
it is easily seen that S*-**) has the expression of a non-parametric estimator with a uniform kernel. 



Thus, in view of Assumption 2.1 (i) and (u), (10) holds true by standard weak consistency arguments 
for functional non-parametric kernel estimators, see, e.g., Bosq (1998, Chapter 3). □ 
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Figure 5.1: Electrical power consumption in Cyprus between 1st January 2007 and 1st January 2010, 
recored every fifteen minutes. 
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Figure 5.2: RMAE for the Functional Similar Shape Time Series Predictor (SSP, solid line) and the 
Functional Wavelet-Kernel Time Series Predictor (WKP, dotted line) for the 30 randomly selected 
days within the year 2010. 
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Date 


RMAE 


SSP 

MaxDiff 


MinDiff 


RMAE 


WKP 
MaxDiff 


MinDiff 


12 Jan 2010 (Wed) 





0120 


21.735 


-21.350 





0344 


13.010 


-62.318 


26 Jan 2010 (Wed) 





0183 


31.771 


-56.341 





0774 


16.698 


-168.246 


10 Feb 2010 (Thu) 





0244 


20.679 


-40.814 





0355 


48.139 


-7.649 


22 Feb 2010 (Tue) 





0139 


19.484 


-25.830 





0269 


41.484 


-38.054 


9 Mar 2010 (Wed) 





0136 


17.372 


-14.837 





0220 


27.278 


-43.017 


30 Mar 2010 (Wed) 





0085 


17.398 


-17.925 





0177 


21.790 


-18.634 


8 Apr 2010 (Fri) 





0174 


23.191 


-12.960 





0194 


60.346 


-29.092 


9 Apr 2010 (Sat) 





0116 


17.667 


-11.217 





0170 


19.972 


-17.742 


26 Apr 2010 (Tue) 





0107 


11.446 


-16.421 





0195 


15.724 


-54.782 


8 May 2010 (Sun) 





0143 


12.152 


-17.117 





0110 


11.436 


-29.488 


17 May 2010 (Tue) 





0227 


42.119 


-18.699 





0496 


69.632 


-7.392 


5 Jun 2010 (Sun) 





0079 


8, .261 


-12.915 





0075 


12.039 


-7.439 


9 Jun 2010 (Thu) 





0168 


23.480 


-19.095 





0201 


10.070 


-32.029 


10 Jun 2010 (Fri) 





0183 


46.482 


-9.387 





0251 


50.860 


-25.818 


22 Jun 2010 (Wed) 





0397 


46.887 


-74.603 





0398 


72.108 


0.0328 


7 Jul 2010 (Thu) 





0209 


19.830 


-36.639 





0117 


25.420 


-16.504 


25 Jul 2010 (Mon) 





0076 


16.212 


-12.496 





0089 


8.709 


-19.766 


1 Aug 2010 (Mon) 





0188 


29.011 


-37.558 





0546 


-83.500 


-117.146 


16 Aug 2010 (Tue) 





0359 


21.442 


-79.744 





0320 


68.523 


-16.523 


25 Aug 2010 (Thu) 





0264 


45.570 


-31.062 





0431 


72.600 


11.020 


12 Sep 2010 (Mon) 





0151 


30.734 


-31.385 





0817 


96.777 


19.450 


17 Sep 2010 (Sat) 





0135 


22.667 


-18.267 





0452 


66.735 


-0.768 


30 Sep 2010 (Fri) 





0207 


35.214 


-17.545 





0208 


18.353 


-47.553 


2 Oct 2010 (Sun) 





0119 


20.838 


-37.730 





0514 


105.903 


-58.231 


17 Oct 2010 (Mon) 





0120 


2.152 


-17.166 





0131 


17.502 


-21.257 


2 Nov 2010 (Wed) 





0096 


9.516 


-16.643 





0223 


11.081 


-28.035 


11 Nov 2010 (Fri) 





0091 


19.189 


-11.575 





0095 


11.235 


-27.899 


1 Dec 2010 (Thu) 





0117 


17.036 


-11.819 





0093 


9.633 


-24.258 


iz uec zuiu ^ivionj 





0752 


04. 400 


CO ceo 
-DO.OOO 





10617 


o.uiy 


-izo.zuy 


27 Dec 2010 (Tue) 





0151 


40.785 


-14.061 





0289 


25.129 


-51.986 



Table 5.1: RMAE, MaxDiff and MinDiff for the Functional Similar Shape Time Series Predictor 
(SSP) and the Functional Wavelet-Kernel Time Series Predictor (WKP) for the 30 randomly selected 
days within the year 2010. 
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Figure 5.3: Actual (solid line) and predicted load using the Functional Similar Shape Time Series 
Predictor (SSP, dashed line) and the Functional Wavelet-Kernel Time Series Predictor (WKP, dotted 
line) for the randomly selected days within the year 2010. 
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Figure 5.4: Actual (solid line) and predicted load using the Functional Similar Shape Time Series 
Predictor (SSP, dashed line) and the Functional Wavelet-Kernel Time Series Predictor (WKP, dotted 
line) for the randomly selected days within the year 2010. 
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Figure 5.5: Actual (solid line) and predicted load using the Functional Similar Shape Time Series 
Predictor (SSP, dashed line) and the Functional Wavelet-Kernel Time Series Predictor (WKP, dotted 
line) for the randomly selected days within the year 2010. 
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Figure 5.6: Actual (solid line) and predicted load using the Functional Similar Shape Time Series 
Predictor (SSP, dashed line) and the Functional Wavelet-Kernel Time Series Predictor (WKP, dotted 
line) for the randomly selected days within the year 2010. 
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